Rapid speaker adaptation by reference model interpolation
نویسندگان
چکیده
We present in this work a novel algorithm for fast speaker adaptation using only small amounts of adaptation data. It is motivated by the fact that a set of representative speakers can provide a priori knowledge to guide the estimation of a new speaker in the speaker-space. The proposed algorithm enables an a posteriori selection of reference models in the speakerspace as opposed to the a priori selection of reference speaker-space commonly used in techniques such as Eigenvoices. We compare the proposed algorithm with the common rapid adaptation techniques within the context of phoneme recognition task. Experimental results on the IDIOLOGOS and PAIDIALOGOS corpus [1] show that the proposed algorithm achieves slightly better improvement than classic Eigenvoices in phoneme accuracy rate, especially for atypical speakers such as children.
منابع مشابه
Rapid Speaker Adaptation With Speaker Clustering
This thesis addresses the problem of rapid speaker adaptation. This is the task of altering the parameters of a speaker dependent speech recognition system so as to make that system look more like a speaker dependent system using a very small amount (<10 seconds) of speaker specific data. The approach to speaker adaptation taken in this work is called speaker cluster weighting (SCW). SCW extend...
متن کاملRapid adaptation using penalized-likelihood methods
In this paper, we introduce new rapid adaptation techniques that extend and improve two successful methods previously introduced, cluster weighting (CW) and MAPLR. First, we introduce a new adaptation scheme called CWB which extends the cluster weighting adaptation method by including a bias term and a reference speaker model. CWB is shown to improve the adaptation performance as compared to CW...
متن کاملImproved Bayesian learning of hidden Markov models for speaker adaptation
We propose an improved maximum a posteriori (MAP) learning algorithm of continuous-density hidden Markov model (CDHMM) parameters for speaker adaptation. The algorithm is developed by sequentially combining three adaptation approaches. First, the clusters of speaker-independent HMM parameters are locally transformed through a group of transformation functions. Then, the transformed HMM paramete...
متن کاملCluster adaptive training for speech recognition
When performing speaker adaptation there are two conicting requirements. First the transform must be powerful enough to represent the speaker. Second the transform must be quickly and easily estimated for any particular speaker. Recently the most popular adaptation schemes have used many parameters to adapt the models. This limits how rapidly the models may be adapted. This paper examines an ad...
متن کاملReconstructing voices within the multiple-average-voice-model framework
Personalisation of voice output communication aids (VOCAs) allows to preserve the vocal identity of people suffering from speech disorders. This can be achieved by the adaptation of HMM-based speech synthesis systems using a small amount of adaptation data. When the voice has begun to deteriorate, reconstruction is still possible in the statistical domain by correcting the parameters of the mod...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007